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Abstract 

We analyze the convergence of the spectrum of large random graphs to the spectrum 
of a limit infinite graph. We apply these results to graphs converging locally to trees and 
derive a new formula for the Stieltjes transform of the spectral measure of such graphs. We 
illustrate our results on the uniform regular graphs, Erdos-Renyi graphs and graphs with a 
given degree sequence. We give examples of application for weighted graphs, bipartite graphs 
and the uniform spanning tree of n vertices. 
MSC-class: 05C80, 15A52 (primary), 47A10 (secondary). 



1 Introduction 

Since the seminal work of Wigner [37], the spectral theory of large dimensional random matrix 
theory has become a very active held of research, see e.g. the monographs by Mehta [26], Hiai 
and Petz [21] . or Bai and Silverstein [3], and for a review of applications in physics, see Guhr et al. 
|2U| . It is worth noticing that the classical random matrix theory has left aside the dilute random 
matrices (i.e. when the number of non-zero entries on each row does not grow with the size of 
the matrix). In the physics literature, the analysis of dilute random matrices has been initiated 
by Rodgers and Bray [33 j . In [7J, Biroli and Monasson use heuristic arguments to anaylze 
the spectrum of the Laplacian of Erdos-Reyni random graphs and an explicit connection with 
their local approximation as trees is made by Semerjian and Cugliandolo in [35J. Also related 
is the recent cavity approach to the spectral density of sparse symmetric random matrices by 
Rogers et al. |34j . Rigorous mathematical treatments can be found in Bauer and Golinelli [I] 
and Khorunzhy, Scherbina and Vengerovsky [23] for Erdos-Reyni random graphs. In parallel, 
since McKay [25J, similar questions have also appeared in graph theory and combinatorics, for 
a review, refer to Mohar and Weiss [29]. In this paper, we present a unified treatment of these 
issues, and prove under weak conditions the convergence of the empirical spectral distribution 
of adjacency and Laplacian matrices of large graphs. 
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Our main contribution is to connect this convergence to the local weak convergence of the 
sequence of graphs. There is a growing interest in the theory of convergence of graph sequences. 
The convergence of dense graphs is now well understood thanks to the work of Lovasz and 
Szegedy [2A\ and a series of papers written with Borgs, Chayes Sos and Vesztergombi (see 
[lit [PZ] and references therein) . They introduced several natural metrics for graphs and showed 
that they are equivalent. However these results are of no help in the case studied here of diluted 
graphs, i.e. when the number of edges scales as the number of vertices. Many new phenomena 
occur, and there are a host of plausible metrics to consider [9]. Our first main result (Theorem 
[T]) shows that the local weak convergence implies the convergence of the spectral measure. Our 
second main result (Theorem [2]) characterizes in term of Stieltjes transform the limit spectral 
measure of a large class of random graphs ensemble. The remainder of this paper is organized 
as follows: in the next section, we give our main results. In Section [3l we prove Theorem [TJ in 
Section H] we prove Theorem [2l Finally, in Section we extend and apply our results to related 
graphs. 

2 Main results 

2.1 Convergence of the spectral measure of random graphs 

Let G n be a sequence of simple graphs with vertex set [n] = {1, . . . ,n} and undirected edges 
set E n . We denote by A = A(G n ), the n x n adjacency matrix of G n , in which Aij = 1 if 
(ij) G E n and A^ = otherwise. The Laplace matrix of G n is L(G n ) = D{G n ) — A(G n ), where 
D = D(G n ) is the degree diagonal matrix in which Da = deg(G n ,i) := Ylje[ n ] ^ij is the degree 
of i in G n and Dij = for all i ^ j. The main object of this paper is to study the convergence of 
the empirical measures of the eigenvalues of A and L respectively when the sequence of graphs 
converges weakly as defined by Benjamini and Schramm [5] and Aldous and Steele [2j (a precise 
definition is given below). Note that the spectra of A(G n ) or L(G n ) do not depend on the 
labeling of the graph G n . If we label the vertices of G n differently, then the resulting matrix 
is unitarily equivalent to A(G n ) and L(G n ) and it is well-known that the spectra are unitarily 
invariant. For ease of notation, we define 

A n = A(G n )-aD(G n ), 

with a G {0, 1} so that A n = A(G n ) if a = and A n = —L(G n ) if a = 1. The empirical spectral 
measure of A n is denoted by 

n 
i=l 

where (Aj(A n ))i<j< n are the eigenvalues of A ra . We endow the set of measures on M with 
the usual weak convergence topology. This convergence is metrizable with the Levy distance 
L(/i, u) = m.i{h > : Vx S K, /x((— oo, x — h]) — h < u((—oo, x]) < //((-co, x — h]) + h}. 

We now define the local weak convergence introduced by Benjamini and Schramm [5] and 
Aldous and Steele [2j. For a graph G, we define the rooted graph (G,o) as the connected 
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component of G containing a distinguished vertex o of G, called the root. A homomorphism 
form a graph F to another graph G is an edge-preserving map form the vertex set of F to the 
vertex set of G. A bijective homomorphism is called an isomorphism. A rooted isomorphism 
of rooted graphs is an isomorphism of the graphs that takes the root of one to the root of the 
other. [G,o] will denote the class of rooted graphs that are rooted-isomorphic to (G,o). Let Q* 
denote the set of rooted isomorphism classes of rooted connected locally finite graphs. Define a 
metric on Q* by letting the distance between (G 1; oi) and (G2,o 2 ) be l/(a + 1), where a is the 
supremum of those r € N such that there is some rooted isomorphism of the balls of radius r 
(for the graph-distance) around the roots of Gj. Q* is a separable and complete metric space pQ. 
For probability measures p, p n on Q* , we write p n p when p n converges weakly with respect 
to this metric. 

Following [lj, for a finite graph G, let U(G) denote the distribution on Q* obtained by 
choosing a uniform random vertex of G as root. We also define [/2(G) as the distribution on 
Q* x Q* of the pair of rooted graphs ((G,o\), (G, 02)) where (oi,o 2 ) is a uniform random pair 
of vertices of G. If (G n ),n E N, is a sequence of deterministic graphs with vertex set [n] and 
p is a probability measure on Q* , we say the random weak limit of G n is p if U{G n ) => p. If 
(G n ),n £ N, is a sequence of random graphs with vertex set [n], we denote by E[-] = E n [-] 
the expectation with respect to the randomness of the graph G n . The measure E[J7(G n )] is 
defined as E[U(G n )](B) = K[U(G n )(B)] for any measurable event B on Q* . Following Aldous 
and Steele [2j, we will say that the random weak limit of G n is p if E[f/(G n )] p. Note 
that the second definition generalizes the first one (take E n = Sa n )- In all cases, we denote 
by (G, 0) a random rooted graph whose distribution of its equivalence class in Q* is p. Let 
deg(G n ,o) be the degree of the root under U(G n ) and deg(G, o) be the degree of the root 
under p. They are random variables on N such that if the random weak limit of G n is p then 
lim n _ 00 E[deg(G n ,o) < k] = p({deg(G,o) < k}). 

We will make the following assumption for the whole paper: 

A. The sequence of random variables (deg(G? n , o)), n € N, is uniformly integrable. 

Assumption (A) ensures that if the random weak limit of G n is p, then the average degree 
of the root converges, namely linin^oo E[deg(G n , o)] = p(deg(G,o)). 

To prove our first main result, we will consider two assumptions, one, denoted by (D), for 
a given sequence of finite graphs and another, denoted by (R), for a sequence of random finite 
graphs. 

D. As ft goes to infinity, th.6 random weak limit of Gn is p. 
R. As n goes to infinity, U2(G n ) =^ p p- 

Of course, Assumption (R) implies (D). We are now ready to state our first main theorem: 

Theorem 1 (i) Let G n = ([n],E n ) be a sequence of graphs satisfying assumptions (D-A), 
then there exists a probability measure p onM such that linv^oo /ja„ = p- 
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(ii) Let G n = ([n],E n ) be a sequence of random graphs satisfying assumptions (R-A), then 
there exists a probability measure [/, on M such that, linin^oo EL(/iA n , /x) = 0. 

In (ii), note that the stated convergence implies the weak convergence of the law of p,& n to 
5^. Theorem [T] appeared under different settings, when the sequence of maximal degrees of the 
graphs G n is bounded, see Colin de Verdiere [13], Serre [36] and Elek [T7J. 

2.2 Random graphs with trees as local weak limit 

We now consider a sequence of random graphs G n , n E N which converges as n goes to infinity 
to a possibly infinite tree. In this case, we will be able to characterize the probability measure p. 
Here, we restrict our attention to particular trees as limits but some cases outside the scope of this 
section are also analyzed in Section^ A Galton- Watson Tree (GWT) with offspring distribution 
F is the random tree obtained by a standard Galton- Watson branching process with offspring 
distribution F. For example, the infinite /c-ary tree is a GWT with offspring distribution 5k, see 
Figure [U A GWT with degree distribution F* is a rooted random tree obtained by a Galton- 
Watson branching process where the root has offspring distribution F* and all other genitors 
have offspring distribution F where for all k > 1, F{k — 1) = kF*{k) j YlkkF*(k) (we assume 
^2 k kF*(k) < oo). For example the infinite fc-regular tree is a GWT with degree distribution 5^, 
see Figure Q3 It is easy to check that a GWT with degree distribution defines a unimodular 
probability measure on Q* (for a definition and properties of unimodular measures, refer to [1]). 
Note that if F* has a finite second moment then F has a finite first moment. 




Figure 1: Left: representation of a 3-ary tree. Right: representation of a 3-regular tree. 

Let n £ N and G n = ([n], E n ), be a random graph on the finite vertex set [n] and edge set 
E n . We assume that the following holds 

RT. As n goes to infinity, U2(G n ) converges weakly to p p, where p G Q* is the probability 
measure of GWT with degree distribution F*. 
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Note that we have lim 



-n- 



oo 



E[deg(G n ,o)] = J2k kF *( k ) < 00 and 



n- 



lim E[deg(G n ,t;)| (o,v) £ E n ] = l + S"kF(k) = 1 + V k(k - l)F*(k)/ V kF*(k). 



fc fe fc 



Under assumption (A), this last quantity might be infinite. To prove our next result, we need 
to strengthen it into 

A'. Assumption (A) holds and Ylk k F*(k) < 00. 

We mention three important classes of graphs which converge locally to a tree and which 
satisfy our assumptions. 

Example 1 Uniform regular graph. The uniform /c-regular graph on n vertices satisfies these 
assumptions with the infinite A:-regular tree as local limit. It follows for example easily from 
Bollobas [8] , see also the survey Wormald [38] . 

Example 2 Erdds-Renyi graph. Similarly, consider the Erdos-Renyi graph on n vertices where 
there is an edge between two vertices with probability p/n independently of everything else. This 
sequence of random graphs satisfies the assumptions with limiting tree the GWT with degree 
distribution Poi(p). 

Example 3 Graphs with asymptotic given degree. More generally, the usual random graph 
(called configuration model) with asymptotic degree distribution satisfies this set of hypoth- 
esis provided that X^M^ — l)F*(k) < 00 (e.g. see Chapter 3 in Durrett [15] and Molloy and 
Reed [30]). 




In these three cases, it is easy to see that if G n denotes the random graph on [n] and G is 
a GWT with degree distribution F*, then we have convergence of G n to G in probability which 
implies Assumption (RT). 

Recall that A n = A(G n ) — aD(G n ), with a £ {0,1}. We now introduce a standard tool 
of random matrix theory used to describe the empirical spectral measure /ja„ (see Bai and 
Silverstein [3] for more details). Let R n (z) = (A n — zl n )~ l be the resolvent of A n . We denote 
C+ = {z £ C : 9z > 0}. Let TL be the set of holomorphic functions / from C + to C + such that 
\f(z)\ < 4- For all i G [n], the mapping z 1 — > R n (^z^)a is in T~L (see Section 13. ip . We denote by 
V(7i) the space of probability measures on 7i. The Stieltjes transform of the empirical spectral 
distribution is given by: 





(1) 



where z G C+. Our main second result is the following. 
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Theorem 2 Under assumptions (RT-A'), 



(i) There exists a unique probability measure Q G V(TL) such that for all z G C_ 

-l 



Y{z) ± - z 



N \ _± 

+ a(iV + l) + ^y,(z) , 
i=l / 



where N has distribution F and Y and Yi are iid copies independent of N with law Q. 

(ii) For all z G C+ ; m n (z) converges as n tends to infinity in L 1 to KX(z), where for all 
z G C+, 

X(z) = -(z + aN # + Y,Yi(z)\ , (3) 

where has distribution i 7 * and Yi are iid copies with law Q, independent of N*. 

Equation ([2]) is a Recursive Distributional Equation (RDE). In random matrix theory, the 
Stieltjes transform appears classically as a fixed point of a mapping on H. For example, in the 
Wigner case [37] (i.e. the matrix W n = (Aij / 'y/n)i<ij< n where Ay = Aji are iid copies of A 
with var(A) = a 2 ), the Stieltjes tranform m(z) of the limiting spectral measure satisfies for all 
z G C+, 

777.(2) = — (z + o~ 2 m{z)) . (4) 

It is then easy to show that the limiting spectral measure is the semi circular law with radius 
2a. 

We explain now why the situation is different in our case. Let X n {£) = R n (z)oo 

G H be 

the diagonal term of the resolvent matrix corresponding to the root of the graph G n . From (TjQ), 
we see that m n (z) = U{G n ){X n {z)). In a first step, we will prove that the random variable 
X n (z) under U(G n ) converges in distribution to X{z) given by ([3]). Then we finish the proof 
of Theorem [2] with the following steps lim n m n (z) = lim n K[U(G n )(X n (z))] = ELY(z)]. For the 
proof of the convergence of X n (z), we first consider the the case of uniformly bounded degrees 
in G, i.e. maxj deg(G n , i) < £. In this case, Mohar proved in [28] that it is possible to define the 
resolvent R of the possibly infinite graph (G, o) with law p. We then show that X n (z) converges 
weakly to R(z) QO under p. Thanks to the tree structure of (G, o), we can characterize the law 
of R(z) QO with the RDE ([2]). Recall that the offspring distribution of the root has distribution 
F* whereas all other nodes have offspring distribution F which explains the formula ([3]) and ([2]) 
respectively. A similar approach was used in [10] for the spectrum of large random reversible 
Markov chains with heavy-tailed weights, where a more intricate tree structure appears. 

Note that for all z G C+, m n {z) and X(z) are bounded by Q^z) -1 , hence the convergence 
in Theorem[2jii) of m n (z) to EX(z) holds in L p for all p > 1. Under the restrictive assumption 
maxj deg(G ra , i) < £, we are able to prove that on 7i, X n converges weakly to X but we do 
not know if this convergence holds in general. We also need Assumption (A') to prove the 
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uniqueness of the solution in ([2]) even though we know from Theorem Q] that the empirical 
spectral distribution converges under the weaker Assumption (A). 

We end this section with two examples that appeared in the literature. 

Example Q] If G n is the uniform fe-regular graph on [n], with k > 2, then G n converges to 
the GWT with degree distribution 5k- We consider the case a = 0, looking for deterministic 
solutions of Y, we find: Y(z) = — {z + (k — l)Y(z)) , hence, in view of (Jlj), Y is simply the 
Stieltjes transform of the semi-circular law with radius 2y/k — 1. For X{z) we obtain, 

X (z) = -(z + kY(z)r l = - 2 f ~ 1} (5) 

(k - 2)z + ky/z 2 - A(k - 1) 

In particular QX(z) = ^{z-\-kY{z))/\z+kY(z)\ 2 . Using the formula fj,[a, b] = \xm v _^ + i f SsX(x+ 
iv)dx, valid for all continuity points a < b of /x, we deduce easily that fj,A n converges weakly to 
the probability measure fx(dx) = f(x)dx which has a density / on [— 2-yk — l,2y/k — 1] given 
by 



k ^4(k - 1) - x 2 



2tt k 2 — x 2 

and f(x) = if x ^ [— 2yfc — 1, 2vfc — 1]. This formula for the density of the spectral measure 
is due to McKay [25J and Kesten [22J in the context of simple random walks on groups. To the 
best of our knowledge, the proof of Theorem [2] is the first proof using the resolvent method of 
McKay's Theorem. It is interesting to notice that this measure and the semi-circle distribution 
are simply related by their Stieltjes transform see ©. 

Example [2] If G n is a Erdos-Renyi graph on [n] , with parameter p/n then G n converges to 
the GWT with degree distribution Poi(p). In this case, F and have the same distribution, 

thus for a = 0, X = Y has law Q. Theorem [2] improves a result of Khorunzhy, Shcherbina and 
Vengerovsky [23J, Theorems 3 and 4, Equations (2.17), (2.24). Indeed, for all z G C+, we use 
the formula e iuw = 1 - y/u J™ M^) e -itw~ 1 ^ va r lc i f or a n u > an d w £ C+, and where 

Ji(t) = \ YlT=o fcT(fcff)! ^ s ^ e ^ esse l function of the first kind. Then with /(it, z) = ]Ke uY ^ and 
(f(z) = ¥,z N we obtain easily from ([2]) that for all u > 0, z £ C+, 



/ ( a. : ) I - v T, I ^^le^^ at f(t, z))dt. 



o 



If N is a Poison random variable with parameter p, then (p{z) = exp(p(z — 1)), and we obtain 
the results in l23l. 
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3 Proof of Theorems [T] 



3.1 Random finite networks 

It will be convenient to work with marked graphs that we call networks. First note that the space 
Ti of holomorphic functions / from C + to C + such that \f(z)\ < ^- equipped with the topology 
induced by the uniform convergence on compact sets is a complete separable metrizable compact 
space (see Chapter 7 in [14]). We now define a network as a graph G = (V,E) together with a 
complete separable metric space, in our case Ti, and a map from V to Ti. We use the following 
notation: G will denote a graph and G a network with underlying graph G. A rooted network 
(G, o) is the connected component of a network G of a distinguished vertex o of G, called the 
root. [G,o] will denote the class of rooted networks that are rooted-isomorphic to (G,o). Let 
Q* denote the set of rooted isomorphism classes of rooted connected locally finite networks. As 
in pp, define a metric on Q* by letting the distance between (Gi, oi) and (G2, 02) be l/(a + 1), 
where a is the supremum of those reft such that there is some rooted isomorphism of the balls 
of (graph-distance) radius r around the roots of Gi such that each pair of corresponding marks 
has distance less than 1/r. Note that the metric defined on Q* in Section [2.11 corresponds to 
the case of a constant mark attached to each vertex. It is now easy to extend the local weak 
convergence of graphs to networks. To fix notations, for a finite network G, let U(G) denote the 
distribution on Q* obtained by choosing a uniform random vertex of G as root. If (G n ),n £ N, is 
a sequence of (possibly random) networks with vertex set [n], we denote by P n [-] the expectation 
with respect to the randomness of the graph G n . We will say that the random weak limit of G n 
ispifF n [U(G n )]^p. 

In this paper, we consider the finite networks {G n , (R n {z)u)i£[ n ]}, where we attach the mark 
Rn(z)u to vertex i. We need to check that the map z 1— > R n (z)u belongs to Ti. First note that 
by standard linear algebra (see Lemma 17.21 in Appendix), we have 

Rn(zh = (A^a-z-Z^A^-z/n-O-W 

where is the ((n — 1) x 1) ith column vector of A n with the ith element removed and A nj j is 
the matrix obtained form A n with the ith row and column deleted (corresponding to the graph 
with vertex i deleted). An easy induction on n shows that R n (z)a £ Ti for all i G [n]. 

3.2 Linear operators associated with a graph of bounded degree 

We first recall some standard results that can be found in Mohar and Woess [29]. Let (G,o) be 
the connected component of a locally finite graph G containing the vertex o of G. There is no 
loss of generality in assuming that the vertex set of G is N. Indeed if G is finite, we can extend 
G by adding isolated vertices. We assume that deg(G) = sup{deg(G, u), u G N} < 00. Let A(G) 
be the adjacency matrix of G. We define the matrix A = A(G) = A(G) — aD(G), where D(G) 
is the degree diagonal matrix and a G {0, 1}. Let = {5^ : i £ N} be the specified complete 
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orthonormal system of L 2 (N). Then A can be interpreted as a linear operator over L 2 (N), which 
is defined on the basis vector e k as follows: 



(Ae k ,ej) = A kj . 



Since G is locally finite, Ae k is an element of L 2 (N) and A can be extended by linearity to a 
dense subspace of L 2 (N), which is spanned by the basis vectors {e k , k G N}. Denote this dense 
subspace Hq and the corresponding operator Aq. The operator Ao is symmetric on Hq and thus 
closable (Section VIII. 2 in [31]). We will denote the closure of Ao by the same symbol A as the 
matrix. The operator A is by definition a closed symmetric transformation: the coordinates of 
y = Ax are 



whenever these series converge. 

The following lemma is proved in [28J for the case a = and the case a = 1 follows by the 
same argument. 

Lemma 2.1 Assume that deg(G) = sup{deg(G, u), u G V} < do, then A is self-adjoint. 

3.3 Proof of Theorem [j] (i) - bounded degree 

In this paragraph, we assume that there exists £ G N such that 



Since Q* is a complete separable metric space, by the Skorokhod Representation Theorem 
(Theorem [7] in Appendix), we can assume that U(G n ) and (G,o) are defined on a common 
probability space such that U(G n ) converges almost surely to (G, o) in Q* . As explained in 
previous section, we define the operators A = A(G) and A n = A(G n ) on the Hilbert space 
L 2 (N). The convergence of U(G n ) to (G, 6) implies a convergence of the associated operators up 
to a re-indexing of N which preserves the root. To be more precise, let (G, o)[r] denote the finite 
graph induced by the vertices at distance at most r from the root o in G. Then by definition 
there exists a.s. a sequence r n tending to infinity and a random rooted isomorphism a n from 
(G, o)[r n ] to U(G n )[r n }. We extend arbitrarily this isomorphism a n to all N. For the basis vector 
efc G L 2 (N), we set cr n {ek) = &a n (k)- For simplicity, we assume that eo corresponds to the root of 
the graph (so that <r n (0) = 0) and we denote eo = o to be consistent with the notation used for 
graphs. By extension, we define a n ((j)) for each G Hq the subspace of L 2 (N), which is spanned 
by the basis vectors {e k ,k G N}. Then the convergence of U(G n ) to (G, o) implies that for all 
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p(deg(G,o) <i) = l. 



(6) 



a n A n a n (p -> Acj> a.s. 



(7) 
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By Theorem VIII. 25(a) in [31], the convergence ([7]) and the fact that A is a self-adjoint operator 
(due to ([6])) imply the convergence of a~ 1 A n a n — > A in the strong resolvent sense: 

o~ l R n (z)a n x — R(z)x — > 0, for any x & L 2 (N), and for all z G C + . 

This last statement shows that the sequence of networks U(G n ) converges a.s. to (G, o) in Q* . 
In particular, we have 

1 n f 
m n {z) = -J2(Rn(z)e l ,e l ) =U(G n )((R n (z)o,o)) - (R(z)o,o)dp[G,o\, (8) 

n i=i J 
by dominated convergence since | (R n (z)ei, e^) | < (Qz)^ 1 . 

Remark 1 A trace operator Tr was defined in JI]/. With their notation, we have 

Tr(R(z))= f(R(z)o,o)dp[G,o\. 



3.4 Proof of Theorem [T] (i) - general case 

Let £ E N, we define the graph G n ^ on [n] obtained from G n by removing all edges adjacent to 
a vertex i, if deg(G n , i) > £. Therefore the matrix A(G n /) denoted by A n ^ is equal to, for i ^ j 



if max{deg(G n ,i),deg(G n , j)} < I 
otherwise 



and (A n ^)a = The empirical measure of the eigenvalues of A n ^ is denoted by 

HAni- By the Rank Difference Inequality (Lemma 17.11 in Appendix), 

L(VA n , VA n ,i) < ^-rank(A n - A n ^), 

where L denotes the Levy distance. The rank of A n — A n ^ is upper bounded by the number of 
rows different from 0, i.e. by the number of vertices with degree at least £ + 1 or such that there 
exist a neighboring vertex with degree at least £ + 1. By definition, each vertex with degree d is 
connected to d other vertices. It follows 

n 

rank(A n - A n/ ) < ^(deg(G n ,i) + l)l(deg(G n , i) > I), 

i=l 

and therefore: 

L(p An ,PA n , e ) < J (deg(G,o) + l)l(deg(G,o) > £)dU(G n )[G,o] =: p n/ . 
By assumptions (D-A), uniformly in £, 

Pn,i -> / (deg(G,o) + l)l(deg(G,o) > £)dp[G,o], 
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where the right-hand term tends to as £ tends to infinity. Fix e > 0, for I sufficiently large 
and n > N(£), we have 

Hence we get, for n > N(£), q G N, 

L(fJ>A n+q , MAn) < 2e + L(^ An+9 , f ,^A n ,,). 

By flSJ), if m n ^(z) denotes the Stieltjes transform of fJ-A nl , we have for all z G C+, lim ?woo m n7 i(z) = 
m (z) for some G Hence for the weak convergence lim n ^oo /j,a u t = H for the probability 
measure \i whose Stieltjes transform is m . It follows that the sequence /j,a„ is Cauchy and the 
proof of Theorem QJi) is complete (recall that the set probability measures on M with the Levy 
metric is complete). 



3.5 Proof of Theorem Q] (ii) - bounded degree 

We assume first the following 

A". There exists £ > 1 such that for all n G N and i £ [nj, deg(G n ,i) < £. 

With this extra assumption, A n and A are self adjoint operators and, as in Section 13.31 we 
deduce that KU(G n ) => ~p. In particular, we have 

E[m n (z)}^ J(R(z)o,o)dp[G,o]. 

Hence, in order to prove Theorem [T] (ii) with the additional assumption (A"), it is sufficient 
prove that, for all z G C+, 

lim E \m n (z) - E[m n (z)]\ 2 = 0. (9) 

n^oo 

Take z G C+ with Sjz > 2£. Notice, that by Vitali's convergence Theorem, it is sufficient to 
prove © for all z such that Qz > 2£. By assumption (A"), we get |(A n )^| < £ k , and we may 
thus write for any integer t > 1, 



to 



t-1 



R n {z)u{z) 



(A n )| A(A„)| 



7 k+l 2-^1 



yk + l 



k=0 
An), 



k=t 



Note that K (n) (i)| < e(t) := T,T=t^$TT < 2"*. We define jff^t) = 
^(t) = ef } (t) <e(t),wehave 



E 



Since 



|m n (z) - E[m n (z)]| < 



-E^ n) (t) 



i=l 



+ 2e(t). 
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Therefore if for all t > 1, we manage to prove that in L 2 (P) 

lim if^ n) (i)=0, (10) 

i=l 

then the proof of Q will be complete. We now prove (|1U|) : 

E (£iX°(')Y = ^{^t\t)xf{t) + ±xt\t? 

V i=l / " i=l 

= E(xi: ) (p)xi: ) (t)), 

where (01,02) is a uniform pair of vertices. We then notice that X i (t) is a measurable func- 

/ fry \ I J / \ 

tion of the ball of radius t and center i. Thus, by assumption (R), lim n EA 0l (t)X Q2 (t) = 
lim Tl EX 01 (t)EX 02 (t) = 0, and (|10p follows. Hence we proved ([9]) under the assumption (A"). 

3.6 Proof of Theorem [1] (ii) - general case 

We now relax assumption (A" ) by assumption (A) . By the same argument as in Section 13.41 we 
get 

EL( MAn ,MA„,J <E J (deg(G,o) + l)l(deg(G, o) > £)dU(G n )[G,o] =:p n/ . 
By assumptions (R-A), uniformly in £, 



Vnl 



E J(deg(G,o) + l)l(deg(G» > i)dp[G,o] 



where the right-hand term tends to as £ tends to infinity. The end of the proof follows by the 
same argument, since we have now for n > N(£), q > 1, 

E£(/iA n+g ,MA„) < 2e + EL(^ An+ q .,fi£^.). 



4 Proof of Theorem [2] 
4.1 Proof of Theorem [2] (i) 

In this paragraph, we check the existence and the unicity of the solution of the RDE ([2]). Let 
= N x 7i°°, where 7i.°° is the usual infinite product space. We define a map ip : O — > TL as 
follows 

ip(n, (hi) ieN ) : C+ C+ 

z - (z + a(n + 1) + Ya=i hiiz))^ 1 . 

Let f be a map from 1^(71) to itself, where ^{P) is the distribution of i^{N, (l^)j e pj), where 
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(i) (Yi,i > 1) are independent with distribution P; 

(ii) N has distribution F; 

(iii) the families in (i) and (ii) are independent. 

We say Q £ V(H) is a solution of the RDE © if Q = (Q). 

Lemma 2.2 There exists a unique measure Q £ ViTL) solution of the RDE ([!)). 

Proof. Let Vt be a bounded open set in the half plane {z S C + : Qz > y/KN + 1} (by 
assumption (A'), EA is finite). Let V{7i) be the set of probability measures on H. We define 
the distance on V(7i) 

W(P,Q)=MM [ \X(z) -Y(z)\dz 
Jn 

where the infimum is over all possible coupling of the distributions P and Q where X has law P 
and Y has law Q. The fact that W is the distance follows from the fact that two holomorphic 
functions equal on a set containing a limit point are equal. The space Vi^K) equipped with the 
metric W gives a complete metric space. 

Let X with law P, Y with law Q coupled so that W(P,Q) = E J n \X(z) - Y(z)\dz. We 
consider (Xj,Yj)j g N ud copies of (X,Y) and independent of the variable N. By definition, we 
have the following 

W(¥(i>),tf(Q)) < 

< 



< 
< 

Then since inf^gn Qz > \/KN, ^ is a contraction and from Banach fixed point Theorem, there 
exists a unique probability measure Q on TL such that ^(Q) = Q. □ 

4.2 Resolvent of a tree 

In this paragraph, we prove the following proposition. 

Proposition 2.1 Let F* be a distribution with finite mean, and (T n , 1) be a GWT rooted at 1 
with degree distribution stopped at generation n. Let A(T n ) be the adjacency matrix of T n , 
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E / \4>(N, (Xi); z) — ip(N, (Yi);z)\dz 



z + a(N + 1) + Eili + «(A + 1) + Eilx Fi(z; 

TV 

/ ($fz)- 2 EJ2\Xi(z)-Yi{z)\dz 
Jn i=1 



dz 



EA(inf ^z)~ 2 W{P,Q). 

7.GQ. 



and A(T n ) = A(T n ) - aD(T n ). Let R (n ' T \z) = (A(T n ) - zl)- 1 and X {n ?\z) = R$ ' >(z). For 
all z G C+, as n goes to infinity X^ n ' T \z) converges weakly to X(z) defined by Equation f3|). 



We start with the following Lemma which explains where the RDE ([2]) comes from. 

Lemma 2.3 Let F be a distribution with finite mean, and (T n , 1) be a GWT rooted at 1 with 
offspring distribution F stopped at generation n. Let A(T n ) be the adjacency matrix ofT n , and 
A(T n ) = A(T n ) - a{D{T n ) + V{T n )), where V(T n ) n = 1 and V{T n )ij = for all (i,j) ^ (1, 1). 

Let R^ T \z) = (A(T n ) - zl)~ l and Y^ T \z) = r!$ T \z). For all z G C+, as n goes to infinity 
Y^ n ' T \z) converges weakly to Y(z) given by the RDE H)). 



Proof. For simplicity, we omit the superscript T and the variable z. We order the vertices 
of (T n , 1) according to a depth-first search in the tree. We denote by N = the number 
of offsprings of the root and by T^, • • • , the subtrees of T n \{l} ordered in the order of the 
depth-first search. With this ordering, we obtain a matrix A(T n ) of the following shape: 



/o 

1 



1 





1 



V 



1 



1 



A(T n 2 ) 



1 -A 



/ 



We then use a classical of Schur decomposition formula for A(T n ) — zL (see Lemma 17.21 in 
Appendix) 



y(n) 



+ a(£>(T n )n + 1) + £ R% X) A{T n ) u A{T n ) Xi 

2<i,J<n 



(11) 



where i?( n_1 ) = (A 1 ™™ 1 ) — zL)~ l with A( n_1 ) is the matrix obtained from A(T n ) with the first 
row and column deleted. It follows that i?( n_1 ) is decomposable in the following diagonal block 
form 

R(T 2 n ) 



\ 
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where R(T£) = (A(T^) — zl) . In particular, we get 

R%- 1) A(T n ) li A(T n ) lj = Q if i^j. (12) 
Indeed, if A(T n )uA(T n )ij = 1 then both i and j are offsprings of 1 in separate subtrees so that 

i?i;- 1} = o. 

(n) 

Since T n is a Galton Watson tree of depth n, the number of offsprings of the root, N = D\ x , 
has distribution F. Moreover the subtrees T^, • • • , T^j are iid with common distribution T n _i, 
and are independent of N . We now define 

Y^ = ((A(Ti)-zir\ >Vi = R^ v p. 
^) are iid, ind 

y(n-i). F rom Equations {^TT]), ([12]), we deduce 

yw = - ( z + a(iv + 1) + Y, Y i n ~ 1] 



It follows that (Y^ , • • • , 5jy ) are iid, independent of TV, with the same common law than 



N 



i=l 



In other words, with a slight abuse of notation, and identifying a random variable with its 
distribution, we have = ^(y^- 1 )), where the mapping \& was defined in ^4.11 The end of 
the proof follows directly from Lemma [2.21 □ 

Proof of Proposition \2.1\ Again, we omit the superscript T and the variable z. As above, we 
use the decomposition formula: 

= - ( z + aD(T n ) u + Y, R^MTuhiAiTn^j 

2<M<n 



where R^ n ^ = (A^ n ^ — zl) 1 with A^ n ^ is the matrix obtained from A(T n ) with the first 



row and column deleted. As above, since T n is a tree R^j A(T n )uA(T n )ij = if % 7^ j, so that 



we get 

XW = -(z + uN* + Y Y i n ~ 1] 



jv* \ - 1 



where iV* has distribution and Y^ n 1 ^ are iid copies of Y^ n % \ independent of N*, defined in 
Lemma 12.31 Proposition 12.11 follows easily from Lemma l2.31 □ 

4.3 Proof of Theorem [2] (ii) - bounded degree 

We first assume that Assumption (A") holds. Let T be a GWT with degree distribution F* and 
T n be the restriction of T to the set of vertices at distance at most n from the root (i.e. T n 
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is stopped at generation n). Let A = A(T) denote the operator associated to T, as in Section 
13.21 A is self-adjoint and we may define for all z £ C+, R(z) = (A — zT) . Then by Theorem 
VIII. 25(a) in [31], A(T n ) converges to A in the strong resolvent sense. Hence, by Proposition 
12.11 we have 

(R(z)o,o) = X(z). 

Now, as in ra[33l AW and A are self adjoint operators and MU(G n ) p. From Theorem 
[1] we have 

m n (z) -^-> J (R(z)o,o)dp[G,o\. 
The proof of Theorem [2] (ii) is complete with the extra assumption (A"). 

4.4 Proof of Theorem [2] (ii) - general case 

We now relax assumption (A") by assumption (A). From Theorem [Tj it is sufficient to prove that 
lim n Km n (z) converges to EX(z), where X is defined by Equation ([3]). By the same argument 
as in $3A\ it is sufficient to prove that, for the weak convergence on H, 

lim X (i) = X 

where X^ is denned by Equation (J3j) with a degree distribution Flp which converges weakly 
to F as t goes to infinity. This continuity property is established in Lemma 17.31 (in Appendix) . 
The proof of Theorem [2] is complete. 

5 Applications and extensions 
5.1 Weighted graphs 

A weighted graph is a graph G = (V, E) with attached weights on its edges. As in §2.1 1 we 
consider a sequence of graphs G n on [n]. We define the symmetric matrix W = («\j)i<ij< ra , 
where (wij)i<i<j is a sequence of iid real variables, independent of G n and Wij = Wji. Let o 
denote the Hadamard product (for all i, j, (A o B)^ = AijBij) and let T(G n ) be the diagonal 
matrix whose entry is equal to J2j w ij^tj^ ■ We define p-^ n as the spectral measure of the 
matrix A n = W o A(G n ) — aT(G n ). If o denotes the uniformly picked root of G n , we assume 

B. The sequence of variables ^X^fc=i ^ife^l^ofeN G N, is uniformly integrable. 
An easy extension of Theorem Q] is 
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Theorem 3 (i) Let G n = ([n],E n ) be a sequence of graphs satisfying assumptions (D-B). 
Then there exists a probability measure v on IR such that a.s. lirrin^oo /i^ = v. 

(ii) Let G n = ([n],E n ) be a sequence of random graphs satisfying Assumptions (R-B). Then 
there exists a probability measure v onM such that, lirrin^oo KL(fj,-^ , v) = 0. 

The only difference with the proof of Theorem [1] appears in §3.41 for HgN, the matrix A nj £ is 
now equal to, for i / j 

(A n e ) i:i = I A i°nhwij if max{££ =1 A^K&I, ELl A jk\ w jk\} < * 
1 otherwise 

and (A nt £)n = -aEj^(^«,f)ij' The remainder is identical. 

We may also state an analog of Theorem [2] for the case a = 0, that is A n = W o A(G n ). We 
denote by s n the Stieltjes transform of fj,-^ . Assumption (A') is strengthen into 

B'. Assumption (B) holds, ^2 k k 2 F !l: (k) < oo and E[wf 2 ] < oo. 
The proof of the next result is a straightforward extension of the proof of Theorem [2l 

Theorem 4 Assume that assumptions (RT-B') hold and a = then 

(i) There exists a unique probability measure P G V(TC) such that for all z £ C+, 



r(*) = 



where N has distribution F, Wi are iid copies with distribution w\2, Y and Y{ are iid copies 
with law P and the variables N,Wi,Yi are independent. 

(ii) For all z G C + , s n (z) converges as n tends to infinity in L 1 to EX(z), where for all z G C + , 

X(z) = -(z + Yl\w l \ 2 Y l (z)] , 



i=i 



where N* has distribution F*, Wi are iid copies with distribution w\\, Yi are iid copies with 
law P and the variables N,Wi,Yi are independent. 

The case a = 1 is more complicated: the diagonal term T(G n ) introduces a dependence 
within the matrix which breaks the nice recursive structure of the RDE. 
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5.2 Bipartite graphs 



In Q2.21 we have considered a sequence of random graphs converging weakly to a GWT tree. 
Another important class of random graphs are the bipartite graphs. A graph G = (V, E) is 
bipartite if there exists two disjoint subsets V a , V b , with V a U V = V such that all edges in E 
have an adjacent vertex in V a and the other in V b . In particular, if n and p are the cardinals of 

V a and V b , the adjacency matrix of a bipartite graph may be written as I ) for some 



M 

n x p matrix M. The analysis of random bipartite graphs finds strong motivation in coding 
theory, see for example Richardson and Urbanke [32J. Note also that if M is an n x p matrix, 

M* 



the spectrum of M*M may be obtained from the spectrum of ( ^ ) . We may thus find 



another motivation in sparse statistical problem, see El Karoui |16j . 

The natural limit for random bipartite graphs is the following Bipartite Galton- Watson Tree 
(BGWT) with degree distribution (F*, G*) and scale p £ (0, 1). The BGWT is obtained from a 
Galton- Watson branching process with alternated degree distribution. With probability p, the 
root has offspring distribution F*, all odd generation genitors have an offspring distribution G, 
and all even generation genitors (apart from the root) have an offspring distribution F. With 
probability 1 — p, the root has offspring distribution G*, all odd generation genitors have an 
offspring distribution F, and all even generation genitors have an offspring distribution G. 

We now consider a sequence (G n ) of random bipartite graphs satisfying assumptions {R — A') 
with weak limit a BGWT with degree distribution (F*,G*) and scale p £ (0,1). The weak 
convergence of a natural ensemble of bipartite graphs toward a BGWT with degree distribution 
(.F*, G*) and scale p £ (0, 1) follows from [32J, p being the proportion of vertices in V a , the 
asymptotic degree distribution of vertices in V a and 6* the asymptotic degree distribution of 
vertices in V a . As usual, we denote by /j,A n the spectral measure of A n , m n is the Stieltjes 
transform of fJ-A„- We give without proof the following theorem which is a generalization of 
Theorem [2j 



Theorem 5 Under the foregoing assumptions, 

(i) There exists a unique pair of probability measures (R a ,R b ) £ V(7i) x V(H) such that for 
all z £ C_|_, 

/ N a 
Y a (z) = -iz + a(N a + 1) + Y i(z) 
\ i=i 

/ N b 

Y b {z)^-[z + a{N b + l) + jy?{z) 

where N a (resp. N b ) has distribution F (resp. G) andY a ,Y? (resp. Y b ,Y b ) are iid copies 
with law R a (resp. R b ), and the variables N b ,Y i a ,N a ,Y b are independent. 
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(ii) For all z G C+, m n (z) converges as n tends to infinity in L 1 to pKX a (z) + (1 — p)~EX b (z) 
where for all z G C+, 



N a \ I 



X b (z)^-(z + aN^ + ^2Y t a (z) 



Ni x 1 



i=l 



where N£ (resp. N b ) has distribution F* (resp. G*) and Yf 1 (resp. Y b ) are iid copies with 
law R a (resp. R ), independent of N b (resp. N£ ). 

In the case a = and for bi-regular graphs (i.e. BGWT with degree distribution (5k, Si) and 
parameter p), the limiting spectral measure is already known and first derived by Godsil and 
Mohar [18], see also Mizuno and Sato [27] for an alternative proof. 



5.3 Uniform random trees 

The uniformly distributed tree on [n] converges weakly to the Skeleton tree which is defined 
as follows. Consider a sequence To, T±, ■ • ■ of independent GWT with offspring distribution the 
Poisson distribution with intensity 1 and let Vq, Vq, ■ ■ ■ denote their roots. Then add all the edges 
(vi, Wj+i) for i > 0. The distribution in Q* of the corresponding infinite tree is the Skeleton tree. 
See [2] for further properties and Grimmett [19] for the original proof of the weak convergence 
of the uniformly distributed tree on [n] to the Skeleton tree T^. 

Let HA{G n ) denote the spectral measure of the adjacency matrix of the random spanning 
tree T n on [n] drawn uniformly (for simplicity of the statement, we restrict ourselves to the case 
a = 0). We denote by m n (z) the Stieltjes transform of ^A(G n )- 

As an application of Theorems [lj [2j we have the following: 

Theorem 6 Assume a = 0. 

(i) There exists a unique probability measure R E V(T~L) such that for all z S C+, 

X{z)^{W(zr 1 -X l {z))-\ (13) 

where W = (A(Tq) — z/)" 1 € TL is the resolvent taken at the root of a GWT with offspring 
distribution Poi(l), X and X\ have law R and the variables W and X\ are independent. 

(ii) For all z G C+, m n (z) converges as n tends to infinity in L 1 to KX(z). 
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Sketch of Proof. The sequence T n satisfy (R-A'), thus, from Theorem [TJ it is sufficient to show 
that the resolvent operator R = (A — zl)^ 1 of the Skeleton tree taken at the root satisfies 

the RDE (|13p . The distribution invariant structure of implies that X(z) = R(z) VlVl . The 
number of offsprings of the root, v\ of T\ is a Poisson random variable with intensity 1, say N . 
We denote the offsprings of v± in T\ by v\ , • • • , v\ . From the Schur decomposition formula, we 
have ^ 

R{z)v lVl =~{^Z + R(z) V2V2 + Rl ( Z )vjvl\ > 

where R(z) is the resolvent of the infinite tree obtained by removing T\ and v\ and R l {z) is the 
resolvent of the subtree of the descendants of vj in T\. Now, by construction R(z) has the same 
distribution than R(z), and R l (z) are independent copies, independent of R(z) of W{z). Thus 
we obtain 

X(z) i -^ z + Xl (z)+jyW i (z)j , (14) 

= - (-Wiz)- 1 + X&))- 1 , (15) 

where in (I15p . we have applied ([2]) for GWT with Poisson offspring distribution. The existence 
and unicity of the solution of (|13p follows from (|14p using the same proof as in Lemma 12.21 □ 



Appendix 

For a proof of the next theorem, see e.g. Billingsley [6]. 

Theorem 7 (Skorokhod Representation Theorem) Let \i n , n£N, be a sequence of prob- 
ability measures on a complete metric separable space S. Suppose that \x n converges weakly to 
a probability measure jx on S. Then there exist random variables X ni X defined on a common 
probability space (0, J 7 , P) such that \i n is the distribution of X n , [i is the distribution of X, and 
for every uo G Q, X n (uj) converges to X(u). 

The next lemma is a consequence of Lidskii's inequality. For a proof see Theorem 11.42 in 

® 

Lemma 7.1 (Rank difference inequality) Let A, B be two n x n Hermitian matrices with 
empirical spectral measures [la = \ Y^i=i ^Aj(A) an d P"B = \ ^2i=l ^Xi(A)- Then 

L(ha, Mb) < -rank(A - B). 
n 

The next lemma is a standard tool of random matrix theory. 
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Lemma 7.2 (Schur formula) Let B 

u being a vector of dimension n — 1. Then 



b u u* 
u B 



, denote anxn hermitian invertible matrix, 



(B- 1 )^ = [b u 



Lemma 7.3 Let (Fi n ^),n E N, be a sequence of probability measures on N converging weakly to 
such that sup n ^ kF* n ^ (k) < oo. Denote by and X the variable defined by with degree 
distribution and F* respectively. Then, for the weak convergence on H, lim n X^ = X. 

Proof. Let F* and Fl be two probability measures on N with finite mean, and let drviF* ,Fl) = 
su Pacn I Ja^*(^ x ) ~ Ia^*^A x )\ = 1/2 X^jfc l-^*(^0 ~~ ^*(^)| be the total variation distance. Let 
N»,Nl, N, N' denote variables with law F*,Fl,F, F' respectively, and coupled so that 2P(iV* ^ 
= drv(F*, F'*) and 2P(iV ^ N') = dry(F, F') (the existence of these variables is guaranteed 
by the coupling inequality). We now reintroduce the distance defined in the proof of Lemma 
12.21 Let be a bounded open set in C+ with an empty intersection with the ball of center 
and radius y/EN + 1. Let V(H) be the set of probability measures on H. We define the distance 
on V(H) 

W{P, P') = inf E / \X(z) - X'(z)\dz 
Jn 

where the infimum is over all possible coupling of the distributions P and P' where X has law 
P and X' has law P' . With our assumptions, we may introduce the variables X := X (with law 
P) and X' := X' (with law P') defined by ([3]) with degree distribution F* and Fl respectively. 
The proof of the lemma will be complete if we prove that there exists C, not depending on F* 
and Fl, such that 

W(P,P')<Cm a x(d TV (F*,Fl),d TV (F,F')). (16) 

We denote by Y (with law Q) and Y' (with law Q 1 ) the variable defined by ([2]) with offspring 
distribution F and F', coupled so that W(Q, Q') = E f n \Y(z)-Y'(z)\dz. We consider (Y u F/) ieN 
iid copies of (Y, Y') and independent of the variable iV*. By definition, we have the following 

W(P,P') < E f \X'(z)-X(z)\ 1(JV, + K)dz 
Jn 

J (z + aN* + J2 ~ ( z + aN * + £ Y l( z )^j 

< d TV (F*,Fl) / (Qz)- 1 dz+ / (%z)- 2 MJ2\Yi(z)-Y{(z)\dz 
Jn Jn i=1 

< d TV (F*,Fl) [ (<Sz)- l dz + ENjM$z)- 2 W(Q,Q'). 
Jn 



+E 



dz 



(17) 
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We then argue as in the proof of Lemma 12.21 Since ^f(Q) = Q and ^f'(Q') = Q' (where \E' / is 
defined as with the distribution F' instead of F), we get: 

W(Q,Q') = W(*(Q),*'(Q')) 

< e[ \ip(N,(Y i y,z)-i/;(N , ,(Y^-z)\dz 

Jn 

< d TV (F,F') [ (Sz^dz + E [ \x/j(N,(Y i );z)-^(N,(Y^;z)\dz 

Jn Jn 

< d TV (F,F') [ (Qz)- 1 dz + EN(mt %z)- 2 W(Q,Q'). 

Jn zen 

Then since inf 2e ^ Qz > yEN, we deduce that 

This last inequality, together with (fT7|) . implies (fT6|) . □ 
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